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ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 
conference on Management of data June 1997 
Volume 26 Issue 2 

Li many applications, new data is being generated every day. Often an index of the data of a 
past window of days is required to answer queries efficiently. For example, in a warehouse 
one may need an index on the sales records of the last week for efficient data mining, or in a 
Web service one may provide an index of Netnews articles of the past month. In this paper, 
we propose a variety of wave indices where the data of a new day can be efficiently added, 
and old data can ... 
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Proceedings of the 19th annual international ACM SIGIR conference on Research and 
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Proceedings of the 18th annual international ACM SIGIR conference on Research and 
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1 hiformation retrieval 2: Dynamic maintenance of web indexes using landmarks 

Lipyeow Lim , Min Wang , Sriram Padmanabhan , Jeffrey Scott Vitter , Ramesh Agarwal 
Proceedings of the twelfth international conference on World Wide Web May 2003 
Recent work on incremental crawling has enabled the indexed document collection of a 
search engine to be more synchronized with the changing World Wide Web. However, this 
synchronized collection is not immediately searchable, because the keyword index is rebuilt 
from scratch less frequently than the collection can be refreshed. An inverted index is usually 
used to index documents crawled from the web. Complete index rebuild at high fi-equency is 
expensive. Previous work on incremental inverted in ... 
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Volume 23 Issue 2 

With the proUferation of the world's “information highways” a renewed 
interest in efficient document indexing techniques has come about. In this paper, the problem 
of incremental updates of inverted lists is addressed using a new dual-structure index. The 
index dynamically separates long and short inverted lists and optimizes retrieval, update, and 
storage of each type of list. To study the behavior of the index, a space of engineering 
trade-offs which range from optimizing upd ... 
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